Using Prior Information from the Medical Literature in GWAS of Oral Cancer Identifies Novel Susceptibility Variant on Chromosome 4 - the AdAPT Method
نویسندگان
چکیده
BACKGROUND Genome-wide association studies (GWAS) require large sample sizes to obtain adequate statistical power, but it may be possible to increase the power by incorporating complementary data. In this study we investigated the feasibility of automatically retrieving information from the medical literature and leveraging this information in GWAS. METHODS We developed a method that searches through PubMed abstracts for pre-assigned keywords and key concepts, and uses this information to assign prior probabilities of association for each single nucleotide polymorphism (SNP) with the phenotype of interest--the Adjusting Association Priors with Text (AdAPT) method. Association results from a GWAS can subsequently be ranked in the context of these priors using the Bayes False Discovery Probability (BFDP) framework. We initially tested AdAPT by comparing rankings of known susceptibility alleles in a previous lung cancer GWAS, and subsequently applied it in a two-phase GWAS of oral cancer. RESULTS Known lung cancer susceptibility SNPs were consistently ranked higher by AdAPT BFDPs than by p-values. In the oral cancer GWAS, we sought to replicate the top five SNPs as ranked by AdAPT BFDPs, of which rs991316, located in the ADH gene region of 4q23, displayed a statistically significant association with oral cancer risk in the replication phase (per-rare-allele log additive p-value [p(trend)] = 2.5×10(-3)). The combined OR for having one additional rare allele was 0.83 (95% CI: 0.76-0.90), and this association was independent of previously identified susceptibility SNPs that are associated with overall UADT cancer in this gene region. We also investigated if rs991316 was associated with other cancers of the upper aerodigestive tract (UADT), but no additional association signal was found. CONCLUSION This study highlights the potential utility of systematically incorporating prior knowledge from the medical literature in genome-wide analyses using the AdAPT methodology. AdAPT is available online (url: http://services.gate.ac.uk/lld/gwas/service/config).
منابع مشابه
Novel Bi-allelic PDE6C Variant Leads to Congenital Achromatopsia
Background: The clinical phenotyping of patients with achromatopsia harboring variants in phosphordiesterase 6C (PDE6C) has poorly been described in the literature. PDE6C encodes the catalytic subunit of the cone phosphodiesterase, which hydrolyzes the cyclic guanosine monophosphate that proceeds with the hyperpolarization of photoreceptor cell membranes, as the final step of the phototransduct...
متن کاملDirect Bisulfite Sequencing and Methylation Specific PCR to Detect Methylation of p15INK4b and F7 genes in Coronary Artery Disease Patients
Genome-Wide Association Studies (GWAS) have identified genetic variants contributing to the risk of cardiovascular disease (CVD) at the chromosome 9p21 locus. The chromosome 9p21 is an important susceptibility locus for several multifactorial diseases like ischemic stroke, aortic aneurysm, type 2 diabetes mellitus and coronary artery disease (CAD). F7 gene because of its role in activating the ...
متن کاملHomozygosity Mapping and Targeted Sanger Sequencing Identifies Three Novel CRB1 (Cumbs homologue 1) Mutations in Iranian Retinal Degeneration Families
Background: Inherited retinal diseases (IRDs) are a group of genetic disorders with high degrees of clinical, genetic and allelic heterogeneity. IRDs generally show progressive retinal cell death resulting in gradual vision loss. IRDs constitute a broad spectrum of disorders including retinitis pigmentosa and Leber congenital amaurosis. In this study, we performed genotyping studies to identify...
متن کاملGenetics of Type 2 Diabetes- A Review Article
Objective: Type 2 diabetes (T2D) as a complex disease is the result of genetically heterogeneous factors and environmental issues interaction. Linkage and small-scale candidate gene studies were successful in identification of genetic susceptibilities of monogenic form of diseases. However, they were largely unsuccessful while applying to the more common forms of disease. By designing Genome Wi...
متن کاملVSEAMS: a pipeline for variant set enrichment analysis using summary GWAS data identifies IKZF3, BATF and ESRRA as key transcription factors in type 1 diabetes
MOTIVATION Genome-wide association studies (GWAS) have identified many loci implicated in disease susceptibility. Integration of GWAS summary statistics (P-values) and functional genomic datasets should help to elucidate mechanisms. RESULTS We extended a non-parametric SNP set enrichment method to test for enrichment of GWAS signals in functionally defined loci to a situation where only GWAS ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره 7 شماره
صفحات -
تاریخ انتشار 2012